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Abstract — In the application of linear network coding to 
wireless broadcasting with feedback, we prove that the problem 
of determining the existence of an innovative encoding vector is 
NP-complete when the finite field size is two. When the finite field 
size is larger than or equal to the number of users, it is shown that 
we can always find an encoding vector which is both innovative 
and sparse. The sparsity can be utilized in speeding up the 
decoding process. An efficient algorithm to generate innovative 
and sparse encoding vectors is developed. Simulations show that 
the delay performance of our scheme with binary finite field 
outperforms a number of existing schemes in terms of average 
and worst-case delay. 

I. Introduction 

Linear network coding provides an excellent solution to 
the wireless broadcasting problem in terms of reliability and 
channel utilization [1], [2J. The idea is to send encoded 
packets that are obtained by taking linear combinations over a 
finite field of all the original packets. The encoding vector 
specifies the coefficients for the linear combination and is 
determined by the transmitter An encoded packet together 
with a header which contains the corresponding encoding 
vector is broadcasted to all users. An encoded packet is said to 
be innovative to a user if the corresponding encoding vector is 
not in the subspace spanned by the encoding vectors already 
received by that user It is called innovative if it is innovative 
to all users who have not yet received enough packets for 
decoding. Obviously, if all the encoded packets generated 
by the transmitter for transmission are innovative, the total 
number of packet transmissions for all users to obtain the 
complete set of packets can be minimized. 

To utilize the radio channel efficiently, it is important to 
generate innovative packets. In |l3|, it is shown that if the size 
of the finite field is equal to the number of users, an innovative 
packet can always be found. In |4|, the authors consider a 
system with perfect feedback, in which the transmitter knows 
the status of all users and tries to find an innovative packet 
by a probabilistic algorithm. This approach is shown to be 
rate-optimal if the underlying finite field is sufficiently large. 
The average number of transmissions is analyzed in [5 |. By 
exploiting the feedback information from users, some authors 
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develop algorithms to generate instantly decodable network- 
coded packets in ||6l, 121 so that innovative packets can 
be decoded once the packets are available at the receivers 
without waiting for the complete reception of the full set of 
packets. Randomized broadcast coding without utilizing any 
feedback is analyzed in (H). The computation with large finite 
field may be costly for mobile hand-held devices. In order 
to reduce encoding and decoding complexity, random linear 
network code over the binary field without feeding back what 
the receivers have received is considered in f9l, |TOl. This 
approach lowers computational complexity at the expense of 
larger number of retransmissions. 

When the finite field size is small, an innovative encoding 
vector may not exist. In Section |III1 we prove that the problem 
of determining the existence of innovative encoding vector is 
NP-complete. A related result is also obtained in where 
the broadcast channel is assumed to be noiseless, and the 
problem of minimizing the number of packets required to 
finish off the file transmission with the binary finite field is 
shown to be NP-complete. In Section IV, we show that we 
can always find a sparse and innovative encoding vector with 
at most K non-zero components using a deterministic algo- 
rithm called the cofactor method. In Section |V] the cofactor 
method is compared with some other transmission schemes by 
simulation, for both small and large finite fields. 

II. System Model and Problem Formulation 

We consider a wireless single-hop system consisting of one 
transmitter and K receivers/users. We denote S as the source 
node and Ui as the i-th receiver, where z G {1, 2, . . . , A'}. The 
source node S wants to broadcast a file to all receivers via 
a wireless channel, which is modeled as a broadcast erasure 
channel. The input to the broadcast erasure channel is a q- 
ary alphabet. We will also call a (/-ary alphabet a packet. 
Each receiver successfully receives the transmitted packet with 
probability 1 — Pg, independent of each other, where 
denotes the erasure probability. An erased packet is unrecov- 
erable and discarded, while a successfully received packet is 
assumed to be error-free. We assume that there is a feedback 
channel from each receiver to the source. Upon receiving a 
packet successfully, a receiver sends an acknowledgement to 
the source node. We assume that the feedback channel has no 
delay and no error The source node keeps track of the status 



of each receiver. The transmitted packet is a function of the 
source file and the acknowledgements from the K receivers. 

In this paper, we focus on transmission schemes with linear 
network coding. The alphabet size (7 is a power of prime 
and the alphabet set is identified with the finite field GF{q). 
The file is packetized into N packets. The transmitted packet 
is a linear combination of the N packets, with coefficients 
drawn from GF{q). An encoding vector is an A^-vector whose 
components are the N coefficients used in the generation of a 
transmitted packet. Each user returns an acknowledgement to 
the source node if a packet is received successfully, until he 
has already received N packets whose encoding vectors are 
linearly independent over GF{q). In order to minimize the 
delay of each user, it is crucial to generate encoding vectors 
which are innovative to all users. 

Our objectives are (i) to determine whether an innovative 
encoding vector exists, and (ii) to devise an effective algorithm 
for generating innovative and sparse encoding vectors. 

III. NP-COMPLETENESS IN FINDING AN INNOVATIVE 

Encoding Vector when q = 2 

If the field size q is larger than or equal to K, it is 
known that an innovative encoding can always be found lO. 
Indeed, the number of non-zero encoding vectors which are 
not innovative to user i is equal to g'^S where is the rank of 
the subspace spanned by the encoding vectors already received 
by user i and Vi < N , and hence by the union bound, the 
number of non-zero vectors which are not innovative is at 
most ((/'■i - 1) + - 1) + . . . + (q'^^' - 1). When q>K, 
the number of non-zero and non-innovative encoding vector 
is strictly less than — 1, and hence we can always find an 
innovative packet. 

When the underlying finite field is small, an innovative 
encoding vector may not exist. 

Problem g-IEV: A problem instance consists of K matrices 
Ci over GF{q), i = 1,2, . . . , K, and each matrix has N 
columns. Determine whether there is an TV-dimensional vector 
over GF{q) which is not in the row space of C^, for all i. 

Theorem 1. 2-IEV is NP-complete. 

Proof: The idea is to reduce the 3-SAT problem, well- 
known to be NP-complete |12|, to the 2-IEV problem. Recall 
that the 3-SAT problem is a Boolean satisfiability problem, 
whose instance is a Boolean expression written in conjunctive 
normal form with three variables per clause (3-CNF), and the 
question is to decide if there is some assignment of TRUE and 
FALSE to the variables such that the given Boolean expression 
has a TRUE value. 

Let E he a given Boolean expression with n variables 
xi, . . . , Xn, and m clauses in 3-CNF. We want to reduce the 
3-SAT problem to the 2-IEV problem with N = n + 1 packets 
and K ^ m + 1 users. 

To the i-th clause (i = 1,2, ... ,m), we first construct a 
3 X matrix B^. If the j-th hteral (j = 1,2,3) in the i-th 
clause is Xk, then let the fc-th component in the j-th row of 
Bi be 1, and the other elements be all zero. Otherwise, if the 



j-th literal in the i-th clause is -^Xk, then let the fc-th and 
the {n + l)-st component in the j-th row of B^ be 1, and 
the remaining components be all zero. Let be the matrix 
whose rows form a basis of the orthogonal complement of the 
row space of B^. We will use the fact that a vector v is in the 
row space of Ci if and only if B^v^ — 0. 

Consider an example with n = 4 Boolean variables. From 
the clause -ixi V -1X2 V X3, we get 



B, 



C,; = 



It can be verified that each row in B^ is orthogonal to the rows 
in Ci, i.e., the row space of is the orthogonal complement 
of the row space of B^. 

For the extra user, user m + 1, let B„i+i be the 1 x (n + 1) 
matrix [0„1], where 0„ stands for the 1 x n all-zero vector 
The problem reduction can be done in polynomial time. 

Let X — [xi X2 ■ . ■ Xn] be a Boolean row vector and x = 
[x 1]. Obviously, any solution x to a 3-SAT problem would 
cause the product BjX^ a non-zero vector for j = 1,2, ... ,m 
and [0„l]x^ ^ 0. Therefore x is not in the row space of Cj 
for all j. Hence x is also a solution to the derived 2-IEV 
problem. 

Conversely, any solution to the derived 2-IEV problem also 
yields a solution to the original 3-SAT problem as well. Let 
c = [ci C2 ... c„ Cn+i] G GF{2)"^^ be a solution to the 
derived 2-IEV problem. Note that we must have c„+i = 1 
because of B„,+i. Let i be an integer between 1 and m. Since 
c is not in the row space of C^, the product B^c^ is a non- 
zero vector, for otherwise c would belong to the orthogonal 
complement of B^. Hence, if we assign TRUE to Xk if Ck — 1 
and FALSE to Xk if Ck = 0, for k = 1,2, ... ,n, then the i-th 
clause will have a TRUE value. Since this is true for all i, the 
whole Boolean expression also has a TRUE value. 

The problem 2-IEV is clearly in NP, since it is efficiently 
verifiable. Hence it is NP-complete. ■ 

IV. Generation of Sparse Encoding Vectors 

After receiving N packets whose encoding vectors are 
linearly independent over GF{q), a user can recover the source 
data by solving a system of N linear equations. The standard 
Gaussian elimination requires 0{N^) operations over GF{q). 
One way to reduce the decoding complexity is to choose 
encoding vectors which are sparse. A vector is called w-sparse 
if there are no more than w non-zero components. 

Theorem 1. If q > K , we can find an innovative encoding 
vector which is K-sparse. 

Proof: Suppose that user k, for k = 1,2, ... ,K, has 
received Vk packets whose encoding vectors are linearly in- 
dependent. Let Ck be the rk x N matrix obtained by putting 
together the encoding vectors. We want to find a if-sparse 
innovative encoding vector x = [a;i X2 ... xn]- 

Since is full-rank, we can find rk columns of which 
are linearly independent. Let Ik be a set of indices of 
linear independent columns in C^. For each k, we arbitrarily 
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pick a column whose index is not in Ik- We call this the Suppose that I[ = {1,2}, I2 = = {1,2,3}. We have 
extra column and let I[. be the union of Ik and the index of 
this extra column. The cardinality of I^ is + 1. For each 
k — 1,2, . . . , K, we construct an {rk + 1) x [rk + 1) matrix 
ilk, by first appending the vector x — [xi X2 ... xjv] to the 
bottom of matrix Cfe, and then deleting all columns of the 
resulting matrix except the columns with indices in I^. 

For each k, we compute the + 1 cofactors of the entries 
in the last row of Hfc. Let Xi^. be the variable with largest 
index in the last row of Hfe whose cofactor is non-zero. The 
column indices ii,...,iK so obtained may not be distinct. 
Let J = {ji, j2, ■ ■ ■ ,js} be the set of distinct indices such 
that J = {ii, 12, ... , ix} and ji < j2 < ■ ■ ■ < js- Also, for 
t = 1, 2, . . . , s, we let /Ct be the set of users such that k ^ JCt 
if and only if ik = jt- We remark that J' contains at most K 
distinct indices, i.e., s < K. 

We obtain a iiT-sparse innovative encoding vector as follows. 
First, we set all variables Xi, for i ^ J ,\a zero. Then we assign 
values to Xj^ , xj^ , . . . ,Xj^ sequentially, so that the determinant 
of Hfe is nonzero for all k. For fc e /Ci, the last row of Hfe has 
only one variable, namely Xj-^ , whose value is not yet assigned 
(the rest are all set to zero). The cofactor of Xj-^ in Hfe is non- 
zero. If we expand the determinant of Hfe in the last row, we 
see that the determinant can be written as bi^ xi^ , where hi^ is 
the cofactor of in Hfe. We have a non-zero value if Xj^ is 
non-zero, for all fc G /Ci. We can assign any non-zero element 
of GF{q) to Xj^, and make Hfe non-zero for all k e K,i. 

c,- , have 



Inductively, suppose that the values of Xj^, . . .x 
been assigned. Consider the determinants of Hfe for k G ICt- 
The only variable in the last row of Hfe which has not been 
assigned a value yet is Xj^ . If we expand the determinant on 
the last row, we obtain a linear polynomial in the form of 
Cij. +bi^Xj^, where a^j. is a constant and bi^, is the cofactor of 
in Hfe. There are at most K such degree-one polynomials, 
and thus we can assign a value to Xj^ such that all determinants 
of Hfe are non-zero. Here we have used the assumption that 
q > K. Note that the assignment of Xj^ does not affect the 
determinants of previous users with indices in /Ci U /C2 U • ■ • U 
K-t-i, because jt either does not appear in /Ci U • • • U ICt-i 
or the corresponding cofactor in Hf is equal to zero for £ G 
K.1 U ■ ■ ■ U ICt-i. 

After the end of the process, we have chosen the values 
for xi,X2, ■ ■ ■ ,xn such that |Hfe| is non-zero for all k ~ 
1,2, ... ,K. This encoding vector is innovative to all users 
and contains at most K non-zero components. ■ 

We call the the method described in the proof of Theorem |2] 
the cofactor method. Using the cofactor method, we can 
produce innovative and X-sparse encoding vectors. For the 
decoding, the number of non-zero coefficients in the linear 
system is no more than KN . 

Example 1. When q — K = 2> and = 3, consider 



Ci = [0 1 0] , C2 = 



1 1 
1 1 



, C3- 



1 
2 



In Hi the cofactors of xi and X2 are —1 and respectively. 
Hence, ii = 1. In H2, all cofactors of xi, X2 and 2:3 are 
non-zero. We thus have 12 — 3. In H3, the cofactor of 0:3 is 
nonzero, and so = 3. The index set J' is equal to {1, 3}. The 
two index sets of users are /Ci = {1} and IC2 = {2, 3}. By the 
cofactor method, we first assign to X2. Then we go through 
the variable indices in J in ascending order. For xi, we can 
assign any nonzero value to xi. For example, we pick xi = 1. 
Once Xl and X2 are fixed, we compute the determinants of H2 
and H3, which are —I + X3 and 2x3 respectively. Finally, we 
want to assign a value to 2:3 such that — 1 + 2:3 ^ and 
2x3 7^ 0. The only choice in this example is 2:3 = 2. The 
resulting encoding vector is [10 2]. 

In the cofactor method, the main complexity is related to the 
computation of r + 1 cofactors in an (r + 1) x (r + 1) matrix. 
A straightforward calculation of an r x r determinant requires 
0{r^) arithmetic operations. The calculation of all cofactors in 
a matrix would require 0(r*) operations per each user in each 
step. We can use a more efficient algorithm, called the Bareiss 
algorithm. The number of arithmetic operations over GF{q) 
required in the computation of cofactors per user can be 
reduced to 0{N^). Summing over all K users, the complexity 
for computing all cofactors is 0{KN^). The complexity of 
the rest of the cofactor method is of 0{K^N). The overall 
complexity of the cofactor method is 0{KN^ + K'^N). 
If Jaggi-Sanders algorithm in II 13 I I is applied to solve the 
encoding problem, the complexity is 0{KN^{K + N)). It 
means that the cofactor method is no worse than the algorithm 
in 1 13] in terms of the encoding complexity. But certainly 
the encoding vector produced by Jaggi-Sanders algorithm is 
not sparse. Details on the Bareiss algorithm is given in the 
appendix. 

The cofactor method assumes that q > K. If q < K, the 
cofactor method may fail to find an assignment of the 2;i's 
such that all determinants are non-zero. In that case, we set 
those 2;i's to zero and the encoding vectors so generated may 
not be innovative. But anyway, the returned encoding vector 
is iiT-sparse. Hence we can still apply the cofactor method for 
the case g = 2 to obtain iiT-sparse encoding vectors, which 
are innovative to only a fraction of the K users. 

In lfT4l . the problem of generating the sparest innovative is 
considered, and is shown to be NP-hard when q> K. 

V. Performance Evaluation 

We evaluate the cofactor method via simulations. In the 
simulations, we divide the transmission into two phases. The 
source node first transmits all packets one by one uncoded. 
The K users acknowledge the packets they have successfully 
received. The source node sets up K matrices Cfe, for k = 
1,2 ... ,K. The rows of are the encoding vectors received 
by user k. Since the packets are uncoded in the first phase. 
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Fig. 3. The average total no. of additions vs the number of users (q=2) 
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Fig. 2. The average delay vs the number of users 



Fig. 4. The average total no. of operations vs the number of users (g=101) 



each row of contains exactly one nonzero component. 
We initialize li to be the set of non-zero columns in Ck- In 
the second phase, we transmit the packets using the encoding 
vectors generated by the cofactor method. 

Each simulation points involved 1000 random realizations 
and we assume that iV = 32 and = 0.3. The worst-case 
delay is defined as the average of total number of transmissions 
for S to ensure that all users receive an intact file over 1000 
random realizations. The average delay means the average 
number of transmissions for S so that an intact file can be 
received by a user For the decoding complexity, we count 
the number of additions and multiplications in decoding. In 
our simulations, an addition operation involving two non-zero 
operands is counted. A multiplication operation is counted 
when none of the two operands is 1 or 0. 

Figure [1] shows the worst-case delay performance of our 
system with the cofactor method, the random linear network 
code (RLNC) scheme in which the components are selected 
according to a uniform distribution, the sorted opportunistic 
method (SOM) in |6| and the maximum weight vertex search 
(MWVS) algorithm in JT) for encoding vector generations, 
where both SOM and MWVS generate instantly decodable 
packets. It is found that, for q — 2, the cofactor method always 
performs better than RLNC, SOM and MWVS in terms of the 
worst-case delay. In addition, we also find that the worst-case 



delay performance of the cofactor method with a small finite 
field size (q = 2) is comparable to that of RLNC with a large 
finite field size (q = 101). Next, we consider the average delay 
performance. According to information theory, the best we can 
do is to have N/ (1 — P^) — 45.7 transmissions on average. In 
Figure |2] we observe that both the cofactor method and RLNC 
with large enough finite field size (q = 101) can achieve the 
limit. From the figure, we also see that although all concerned 
methods may not be optimal when q = 2, the cofactor method 
always results in a smaller average delay. 

The decoding algorithms in most of the previous work 
are basically Gauss-Jordan elimination except the instantly 
decodable schemes in f6l, (7). We implement the Gauss- Jordan 
elimination for sparse matrix in our simulation. Note that the 
ii'-sparse property of the cofactor method implies an upper 
bound on the number of non-zero entries in an encoding 
vector. In practice, the average number of non-zero entries 
is significantly less than both K and N even for K > N. As 
a result, significant decoding complexity reduction is expected 
for a system with the cofactor method. Figures |3] and 2] 
show the average total number of operations for all users 
in the system when q = 2 and q = 101, respectively. The 
cofactor method indeed yields significant reduction in both the 
average total number of addition and multiplication operations 
when compared with RLNC. From Figure |3] we observe that. 



with both SOM and MWVS which are instantly decodable, a 
receiver enjoys a low decoding complexity at the expense of 
larger delay. As a result, the cofactor method which always 
generates sparse encoding vectors is a promising choice in 
terms of delay performance and decoding complexity. 

VI. Conclusions 

We devise a cofactor method to generate K-spame encoding 
vector When q > K, it is guaranteed that the resulting 
encoding vector is innovative, and hence the broadcast system 
is delay-optimal. The sparsity can be exploited in devising 
faster decoding algorithm. Simulation result shows that the 
cofactor method outperforms RLNC, SOM and MWVS in 
terms of both the worst-case delay and average delay. On 
the other hand, when q = 2, the problem of determining the 
existence of an innovative encoding vector is NP-complete. 

Appendix 

The Bareiss algorithm is a fraction-free algorithm for com- 
puting determinant lITSl . To illustrate the idea, we apply 
Bareiss algorithm to an n x n matrix M whose elements 
are integers and the last row consists of indeterminates xi, 
X2, ■ ■ ■ ,Xn- the end of the algorithm, the entry in the lower- 
right comer of M is a linear polynomial in xi, X2, - ■ ■ ,Xn, 
and the coefficient of Xi is the corresponding cofactor of Xi in 
the original M. We remark that this is an in-place algorithm, 
and the complexity is in the order of n^. 

Algorithm 1 Bareiss algorithm 

Input: An n x n matrix M. Assume that all principle minors 
of M are nonzero. 

Notation: Let rriij denote the -entry of M, and tooo — 1- 
for fc = 1, . . . , n — 1 do 

Compute <- ""''ZT.TT"' ^ ij = k+l,...,n. 
end for 

Output: Return the (n, ri)-entry of M. 



Example 2. Consider M — 



M 



■ 1 2 3" 

4 5 6 , as an example. 

After the first pass of the for-loop (k = 1), the partial result 
is 

12 3 

4 -3 -6 

.Tl X2 — 2xi X3 — 3X1 



After the end of the algorithm, we have 
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The coefficients of xi, X2 and 2:3 of the polynomial in the 
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(3, 3) -entry ai'e the cofactors 
respectively. 

Furthermore, the algorithm can be run incrementally. Sup 
pose that only the first r rows in a matrix C is available. 
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The entries marked by are not known yet and will be 
revealed later. We can apply the Bareiss algorithm to the 
submatrix obtained by removing the "*" entries and the right 
n — r columns. When the value of the (7- + l)-st row is known, 
we can run the Bareiss algorithm again on the submatrix 
obtained by removing rows r + 2 to n — 1 and the n — r — 1 
columns on the right. We can see that the entries in the first 
r rows and the first r columns are the same as before and we 
do not need to re-calculate them. Only the calculation of the 
2r + 1 new entries are required. For each user, the source node 
essentially runs the Bareiss algorithm on an x matrix, and 
the complexity per user is 0{N^). Summing over all users, 
the complexity involving the computation of the cofactors is 
0{KN^). 
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